Exploring MWEs for Knowledge Acquisition from Corporate Technical Documents

نویسندگان

  • Bell Manrique Losada
  • Carlos Mario Zapata Jaramillo
  • Diego A. Burgos
چکیده

High frequency can convert a word sequence into a multiword expression (MWE), i.e., a collocation. In this paper, we use collocations as well as syntactically-flexible, lexicalized phrases to analyze ‘job specification documents’ (a kind of corporate technical document) for subsequent acquisition of automated knowledge elicitation. We propose the definition of structural and functional patterns of specific corporate documents by analyzing the contexts and sections in which the expression occurs. Such patterns and its automated processing are the basis for identifying organizational domain knowledge and business information which is used later for the first instances of requirement elicitation processes in software engineering.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Corporate Memory: A framework for supporting tools for acquisition, organization and maintenance of information and knowledge

In this paper we describe corporate memory which can support multiple knowledge acquisition, organization and maintenance tools. Memory holds and manages documents and related information and knowledge processed and created by such tools. Tools can work with several types of data such as documents, data in relational database and semantic data. Such diversity of information is needed due to dif...

متن کامل

From Natural Language Documents to Sharable Product Knowledge

A great part of a company's product knowledge is often only available as natural language documents. The disadvantages of this source of information are its informal structure, the lack of actuality and general interdepartmental accessibility. We propose a way to integrate knowledge about a speciic product contained in technical documentation into corporate memory. By analysing e.g. maintenance...

متن کامل

A Method for Semi-Automatic Ontology Acquisition from a Corporate Intranet

The focused access to knowledge resources like intranet documents plays a vital role in knowledge management and supports in general the shifting towards a Semantic Web. Ontologies act as a conceptual backbone for semantic document access by providing a common understanding and conceptualization of a domain. Building domain-specific ontologies is a time-consuming and expensive manual constructi...

متن کامل

Automated Acquisition of Multiword Expressions for Robust Deep Parsing

In this presentation, I mainly deal with automated acquisition of Multiword Expressions as a means of enhancing robustness of lexicalised grammars used in robust deep parsing for real-life applications. Specifically, I begin by taking a closer look at the linguistic properties of MWEs, in particular, their lexical, syntactic, as well as semantic characteristics. The term Multiword Expressions h...

متن کامل

Indexing Corporate Memories through Ontologies

In the context of Knowledge Management, we carry out a Corporate Memories (CM) project for the Company CIRTIL. Our purpose is to focus on the modelling of the application domain. It is built as a domain ontology with a structure supporting a semantic model based on ontological relationships. In this paper we, present our S model which permits to model knowledge and to index documents. We also s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013